Segmentation of Persian Cursive Words Using Basic Shapes

نویسندگان

  • Koorosh Samimi Daryoush
  • Maryam Khademi
  • Alireza Nikookar
  • Aida Farahani
چکیده

Segmentation is a process of dividing cursive words into smaller parts in order to decrease complexity and increase accuracy of handwriting recognition process. However it is a complicated and timeconsuming task. In this paper, we introduce the concepts of basic shapes and explore its application for segmentation of Persian words. Considering a set of pre-defined shapes include line and open or closed curve extracted from Persian alphabets, our approach will employ those shapes with decision tree technique to divide a cursive word into segments in a less complicated process. Experimental results showed 98.83% accuracy in segmenting Persian words.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Approach to Segmentation of Persian Cursive Script based on Adjustment the Fragments

Optical Character Recognition (OCR) is a very old and of great interest in pattern recognition field. The recognition of cursive scripts like Persian and Arabic languages is a difficult task as their segmentation suffers from serious problems in different languages. Segmentation is a process of dividing cursive words into smaller parts in order to decrease complexity and increase accuracy of re...

متن کامل

A New Segmentation Algorithm for Online Handwritten Word Recognition in Persian Script

The cursive nature of Persian alphabet, and the complex and convoluted rules regarding this script cause major challenges to segmentation as well as recognition of Persian words. We propose a new segmentation algorithm for the main stroke of online Persian handwritten words. Using this segmentation, we present a perturbation method which is used to generate artificial samples from handwritten w...

متن کامل

Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach

The presence of a large number of unique shapes called ligatures in cursive languages, along with variations due to scaling, orientation and location provides one of the most challenging pattern recognition problems. Recognition of the large number of ligatures is often a complicated task in oriental languages such as Pashto, Urdu, Persian and Arabic. Research on cursive script recognition ofte...

متن کامل

A Dynamic Programming Method for Segmentation of Online Cursive Uyghur Handwritten Words into Basic Recognizable Units

Correct and efficient segmentation of Uyghur words into characters is crucial to the successful recognition. However, little work has been done in this area. There are many connected characters in cursive Uyghur handwriting, which makes the segmentation and recognition of Uyghur words very difficult. To enable large vocabulary Uyghur word recognition using character models, we propose a charact...

متن کامل

A Robust Free Size OCR for Omni-Font Persian/Arabic Printed Document Using Combined MLP/SVM

Optical character recognition of cursive scripts present a number of challenging problems in both segmentation and recognition processes and this attracts many researches in the field of machine learning. This paper presents a novel approach based on a combination of MLP and SVM to design a trainable OCR for Persian/Arabic cursive documents. The implementation results on a comprehensive databas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012